Integrated Analysis of Multiple Microarray Datasets Identifies a Reproducible Survival Predictor in Ovarian Cancer

نویسندگان

  • Panagiotis A. Konstantinopoulos
  • Stephen A. Cannistra
  • Helen Fountzilas
  • Aedin Culhane
  • Kamana Pillay
  • Bo Rueda
  • Daniel Cramer
  • Michael Seiden
  • Michael Birrer
  • George Coukos
  • Lin Zhang
  • John Quackenbush
  • Dimitrios Spentzos
چکیده

BACKGROUND Public data integration may help overcome challenges in clinical implementation of microarray profiles. We integrated several ovarian cancer datasets to identify a reproducible predictor of survival. METHODOLOGY/PRINCIPAL FINDINGS Four microarray datasets from different institutions comprising 265 advanced stage tumors were uniformly reprocessed into a single training dataset, also adjusting for inter-laboratory variation ("batch-effect"). Supervised principal component survival analysis was employed to identify prognostic models. Models were independently validated in a 61-patient cohort using a custom array genechip and a publicly available 229-array dataset. Molecular correspondence of high- and low-risk outcome groups between training and validation datasets was demonstrated using Subclass Mapping. Previously established molecular phenotypes in the 2(nd) validation set were correlated with high and low-risk outcome groups. Functional representational and pathway analysis was used to explore gene networks associated with high and low risk phenotypes. A 19-gene model showed optimal performance in the training set (median OS 31 and 78 months, p < 0.01), 1(st) validation set (median OS 32 months versus not-yet-reached, p = 0.026) and 2(nd) validation set (median OS 43 versus 61 months, p = 0.013) maintaining independent prognostic power in multivariate analysis. There was strong molecular correspondence of the respective high- and low-risk tumors between training and 1(st) validation set. Low and high-risk tumors were enriched for favorable and unfavorable molecular subtypes and pathways, previously defined in the public 2(nd) validation set. CONCLUSIONS/SIGNIFICANCE Integration of previously generated cancer microarray datasets may lead to robust and widely applicable survival predictors. These predictors are not simply a compilation of prognostic genes but appear to track true molecular phenotypes of good- and poor-outcome.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Survival rate of women with ovarian cancer in Fars Province, Iran

Introduction: Ovarian cancer is the 6th most common cancer and the 7th cause of death from cancer in women all around the world. The aim of this study was to determine the related factors with survival of ovarian cancer among women in Fars Province, southern Iran. Methods: In a survival analysis study, the recorded data at cancer registry center of Shiraz university of medical sciences related ...

متن کامل

Title: Pathway-Based Classification of Cancer Subtypes Running title: Pathway-based classification of cancer subtypes

Molecular markers based on gene expression profiles have been used in experimental and clinical settings to distinguish tumors in stage, grade, survival time, metastasis and drug sensitivity. However, most significant gene markers are unstable (not reproducible) among data sets. We introduce a method representing cancer markers as 2-level hierarchical feature vectors, including stable pathway-b...

متن کامل

Meta-Analysis of Microarray Data Identifies GAS6 Expression as an Independent Predictor of Poor Survival in Ovarian Cancer

Seeking new biomarkers for epithelial ovarian cancer, the fifth most common cause of death from all cancers in women and the leading cause of death from gynaecological malignancies, we performed a meta-analysis of three independent studies and compared the results in regard to clinicopathological parameters. This analysis revealed that GAS6 was highly expressed in ovarian cancer and therefore w...

متن کامل

SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy

 In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....

متن کامل

FXYD5 is a Marker for Poor Prognosis and a Potential Driver for Metastasis in Ovarian Carcinomas

Ovarian cancer (OC) is a leading cause of cancer mortality, but aside from a few well-studied mutations, very little is known about its underlying causes. As such, we performed survival analysis on ovarian copy number amplifications and gene expression datasets presented by The Cancer Genome Atlas in order to identify potential drivers and markers of aggressive OC. Additionally, two independent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2011